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Abstract. String diagrams provide an intuitive language for expressing 
networks of interacting processes graphically. A discrete representation of 
string diagrams, called string graphs, allows for mechanised equational 
reasoning by double-pushout rewriting. However, one often wishes to 
express not just single equations, but entire families of equations between 
diagrams of arbitrary size. To do this we define a class of context-free 
grammars, called B-ESG grammars, that are suitable for defining entire 
families of string graphs, and crucially, of string graph rewrite rules. We 
show that the language-membership and match-enumeration problems 
are decidable for these grammars, and hence that there is an algorithm 
for rewriting string graphs according to B-ESG rewrite patterns. We also 
show that it is possible to reason at the level of grammars by providing 
a simple method for transforming a grammar by string graph rewriting, 
and showing admissibility of the induced B-ESG rewrite pattern. 


1 Introduction 

A string diagram (Fig. [^a)) consists of a collection of boxes (typically used to 
represent certain maps, processes, machines, etc.) connected by wires. 

They are essentially labelled directed graphs, but with one important differ¬ 
ence: wires, unlike edges, can be left open at one or both ends to form inputs 
and outputs, so they have an inherently compositional nature. Joyal and Street 
showed in 1991 that compositions of morphisms in any symmetric monoidal 
category can be represented using string diagrams |12| . and recently there has 
been much interest in applying string diagram-based techniques in a wide vari¬ 
ety of fields. In models of concurrency, they give an elegant presentation of Petri 
nets with boundary |20| . in computational linguistics, they are used to compute 
compositional semantics for sentences H], and in control theory, they represent 
signal-flow diagrams m- Equational reasoning for string diagrams has been 
used extensively in the program of categorical quantum mechanics |T], which 
provides elegant solutions to problems in quantum computation, information, 
and foundations using purely diagrammatic methods mm- 
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Fig. 1: A string diagram and its encoding as a string graph 


All of these applications make heavy use of proofs by diagram rewriting. These 
are proofs whereby some fixed set of string diagram equations are used to derive 
new equations by cutting out the LHS and gluing in the RHS. For example, the 
following string diagram equation: 



There are two things to note here. First, the LHS and RHS of string diagram 
equations always share a common boundary. In other words, we could think 
of the boundary as an invariant sub-diagram that embeds into the LHS and 
the RHS of the rule. Secondly, it is this invariant sub-diagram that is used to 
glue in the RHS once the LHS is removed. We can formalise this process using 
double-pushout (DPO) rewriting. 


We begin by representing (directed) string diagrams as certain labelled, (di¬ 
rected) graphs called string graphs (see Fig.Q. Wires are replaced by chains of 
edges containing special dummy vertices called wire-vertices. By contrast, the 
‘real’ vertices, labelled here f,g and h are called node-vertices. String graphs— 
originally introduced under the name ‘open graphs’ in uni —have the advantage 
that they are purely combinatoric objects, as opposed to the geometric objects 
like string diagrams. As such, they form a suitable category for performing DPO 
rewriting. The DPO diagram associated with the rewrite (1.21 is given in Fig¬ 
ure]^ where the top row is the string graph rewrite rule for (1.1), the left square is 























Fig. 2: DPO example 


the pushout complement removing the LHS, and the right square is the pushout 
gluing in the RHS. 

This technique has been used to mechanise proofs involving string diagrams, 
and forms the foundation of the diagrammatic proof assistant Quantomatic m- 
However, proving equations between single string diagrams is just half the story. 
Typically, one wishes to prove properties about entire families of diagrams. For 
example, suppose we have a node that serves as an n-fold ‘copy’ process. Then, 
one might require, as in Fig. [^a), that connecting another node to the ‘copy’ 
node would result in n copies. Thus, we have an infinite family of (very similar) 
equations, one for each n. 



Fig. 3: An n-fold copy rule, and its formalisation using !-box notation. 


Such a family of equations can be easily captured using a graphical syntax called 
!-box notation. Here, we can indicate that a subgraph (along with its adjacent 
edges) on the LHS and RHS of a rule can be repeated any number of times 
by wrapping a box around it, as in Fig. [^b). This rewrite pattern can then be 
instantiated to a concrete rewrite rule by fixing the number of copies of each 
!-box to retain. A formal description of this instantiation process is given in m- 
This procedure is straightforward to mechanise, and is the main mechanism 
Quantomatic uses for reasoning about string diagram families. However, the 
types of string graph languages representable using !-box notation is quite lim- 













ited. For example, the languages produced by string graphs with !-boxes always 
have finitely-bounded diameter and chromatic number Q Thus many naturally- 
occurring languages containing chains or cliques of arbitrary size are not possible. 

In |14| . we showed that the languages generated by string graphs with (non¬ 
overlapping) l-boxes can always be generated using a context-free vertex re¬ 
placement grammar. Thus, we conjectured that a general notion of a context- 
free grammar for string graphs and string graph rewrite rules could produce a 
more expressive language for reasoning about families of string diagrams. In this 
paper, we will show that this is indeed the case. 

We begin by defining a context-free grammar for string graphs which is built 
on the well-known B-edNCE class of grammars. We call these grammars bound¬ 
ary encoded string graph (B-ESG) grammars, where ‘boundary’ here means the 
grammar satisfies the same boundary condition as B-edNCE grammars. An en¬ 
coded string graph is a slight generalization of a string graph, where certain 
fixed subgraphs can be encoded as a single edge with a special label. A B-ESG 
grammar consists of a B-edNCE grammar for producing encoded string graphs, 
and a set of decoding rules for replacing these special edges. For example, this 
grammar: 





(1.3) 


produces a complete graph of n > 2 white node-vertices, connected by wires, 
which would not be possible in the !-box language. We define B-ESG rules anal¬ 
ogously as two grammars with identical non-terminals, and a 1-to-l correspon¬ 
dence between their productions. Consider, for example, the following rule: 



It will rewrite any complete graph of white node-vertices into a star with white 
node-vertices on every outgoing wire: 



^ For colourability and cliques in string diagrams, we treat chains of wire-vertices as 
single edges. 




















































In this paper, we will define a family of context-free grammars suitable for equa- 
tional reasoning with string diagrams. After giving formal definitions for string 
graphs in Section we will define encoded string diagrams and B-ESG gram¬ 
mars in Section as well as prove that a B-ESG grammar always yields a 
language consisting of well-formed string graphs. We also show that the B-ESG 
membership problem is decidable as well as the match enumeration problem. 
In Section]^ we will extend to B-ESG rewrite rules and show that these always 
form a language of well-formed string graph rewrite rules (i.e. the generated 
rules always have corresponding inputs/outputs). In Sectionwe give a simple 
example of meta-level reasoning with B-ESG grammars, whereby string graph 
rewrite rules can be lifted to admissible transformations of B-ESG grammars, 
thus proving entire families of diagram equations simultaneously. We then gener¬ 
alise this result to show how basic B-ESG on B-ESG rewriting is possible. In the 
conclusion, we discuss how these principles might be extended to more powerful 
versions of B-ESG on B-ESG rewriting and structural induction. 


2 Preliminaries 

Definition 1 (Graph [17] ). A graph over an alphabet of vertex labels E and 
an alphabet of edge labels F is a tuple H = (V, E, A), where E is a finite set of 
vertices, E C {(u, 7, ■u;)|u, w G V,v ^ w,j G E} is the set of edges and X : V ^ S 
is the node labelling function. The set of all graphs with labels A, E is denoted 

GRs,r- 

Note that this notion of graphs forbids self-loops. This is a standard (and con¬ 
venient) assumption in the vertex-replacement grammar literature. It will not 
get in our way, since we always use chains of wire-vertices to encode self-loops 
in string diagrams. Also, this notion of graph allows parallel edges only if they 
have different types. Again, this isn’t problematic for our use case, because string 
graphs cannot have parallel edges (but they do allow parallel wires, cf. Defini¬ 
tion 1^ . 


2.1 B-edNCE grammars 

We will focus on neighbourhood-controlled embedding (NGE) grammars. This is 
a type of graph grammar where non-terminal vertices are replaced by graphs ac¬ 
cording to a set of productions, each endowed with a set of connection instructions 
which determine how the new graph should be connected to the neighbourhood 
of the non-terminal. edNGE grammars are NGE grammars, with edge labels and 
directions, and B-edNGE grammars additionally impose the ‘boundary’ condi¬ 
tion |18| which guarantees confluence for applications of productions. They form 
an important subclass of all confluent edNGE grammars (G-edNGE grammars) 
that is particularly easy to characterise. 


Definition 2 (Graph with embedding |T7]). A graph with embedding over 
labels A, _r is a pair {H,C), where H is a, graph over S,r and C C S x 
r X r X Vh X {in^out'\. C is called a connection relation and its elements are 
called connection instructions. The set of all graphs with embedding over S, F 
is denoted by GRE^.r- 

Graph grammars operate by substituting a graph (with embedding) for a non¬ 
terminal vertex of another graph. Connection instructions are used to introduce 
edges connected to the new graph based on edges connected to the non-terminal. 
A connection instruction {a, P/'y,x,d) says to add an edge labelled 7 connected 
to the vertex x in the new graph, for every /3-labelled edge connecting a tr- 
labelled vertex to the non-terminal, d then indicates whether this rule applies 
to in-edges or out-edges of the non-terminal. For the formal definition of this 
substitution operation, see e.g. m- 

Definition 3 (edNCE Graph Grammar fl7j). An edNCE Graph Grammar 
is a tuple G = {E,A,r,f2,P,S), where E is the alphabet of vertex labels, 
A C A is the alphabet of terminal vertex labels, F is the alphabet of edge labels, 
17 C T is the alphabet of final edge labels, P is a finite set of productions and 
S G E — A is the initial nonterminal. Productions are of the form X -G {D, C), 
where X G E — A is a non-terminal vertex and (D, G) G GREs,r is a graph 
with embedding. 

A derivation in an edNCE grammar is a sequence of substitutions of non¬ 
terminals starting from the the graph which just contains the single starting 
non-terminal S. The set of all graphs (not containing non-terminals) isomorphic 
to some graph reachable in this manner is called the language of the grammar. 
edNCE grammars with the additional property that the order in which non¬ 
terminals are expanded is irrelevant are called confluent edNCE, or C-edNCE, 
grammars. We will focus on a special case: 

Definition 4 (B-edNCE grammar). A boundary edNCE, or B-edNCE, gram¬ 
mar is a grammar such that for all productions p : A —>■ {D,C), D contains no 
adjacent non-terminal nodes and C contains no connection instructions of the 
form {a, fl/"/,x,d) where cr is a non-terminal label. 


2.2 String graphs and rewriting 

Definition 5 (String Graph). For disjoint sets Af = {Nj, Ng,...}, W = 
{Wa,Wbt--}, a (directed) string graph is a directed graph labelled by the 
set Af U W, where vertices with labels in Af are called node-vertices and vertices 
with labels in W are called wire-vertices, and the following conditions hold: ( 1 ) 
there are no edges directly connecting two node-vertices ( 2 ) the in-degree of ev¬ 
ery wire-vertex is at most one and (3) the out-degree of every wire-vertex is at 
most one. 


The category SGraph has as its objects string graphs and its morphisms string 
graph homomorphisms (i.e. graph homomorphisms respecting labels). In full 
generality, string graphs also allow one to restrict which node-vertices can be 
connected to which wire-vertices, and allow for an ordering on in- and out-edges 
(e.g. for non-commutative maps), but for simplicity, we will consider the case 
where any node-vertex can be connected to any wire-vertex and the ordering is 
irrelevant. 

We will depict wire-vertices as small black dots and node-vertices as larger nodes 
of various shapes and colours. We can define undirected string graphs anal¬ 
ogously, by replacing the last two conditions with the requirement that each 
wire-vertex have degree at most 2. To avoid excessive duplication we will state 
all of our results for the directed case, but similar results carry through to the 
undirected case. Thus, we will occasionally give undirected examples when they 
are more convenient than their directed counterparts. 

Definition 6 (Inputs, Outputs and Boundary Vertices). A wire-vertex of 
a string graph G is called an input if it has no incoming edges. A wire-vertex 
with no outgoing edges is called an output. The boundary of G consists of all of 
its inputs and outputs. 

Wires in string diagrams are geometric in nature. They are encoded in string 
graphs as chains of wire-vertices. 

Definition 7. A wire is a maximal connected subgraph of a string graph con¬ 
sisting of only wire-vertices and at least one edge. There are three cases: (a) it 
forms a simple directed cycle, which is called a circle, (b) it is a chain where one 
or both endpoints are connected to node-vertices, which is called an attached 
wire, or (c) it is a chain not connected to any node-verties, which is called a bare 
wire. 



Fig. 4: Types of wires: (a) circle, (b) attached wire, (c) bare wire 


In particular, when considering embeddings of string diagrams into each other, 
wires are allowed to be sub-divided arbitrarily. To accommodate this behaviour 
with string graphs, we represent wires as chains of wire-vertices. Of course, this 
choice is not unique, as we can represent a single wire with a chain of wire-vertices 
of any length. Being isomorphic up to the number of wire-vertices representing 
each wire is a natural notion of ‘sameness’ for string diagrams, which is called 
wire-homeomorphism. 



Definition 8 (Wire-homeomorphie string graphs). Two string graphs G 
and G' are called wire-homeomorphic, written G ^ G" if G' can be obtained from 
G by either merging two adjacent wire-vertices (left) or by splitting a wire-vertex 
into two adjacent wire-vertices (right) any number of times: 


Two string graphs G ~ G' are "semantically" the same and only differ by 
the length of some of its wires. Note, that for any string graph G, its wire- 
homeomorphism class has a unique minimal representative, which can be ob¬ 
tained from G by just contracting wires as much as possible, so wire-homeomorphism 
is decidable. 

In order to rewrite a string graph using a string graph rewrite rule, one first finds 
a matching of the LHS. Note we say an edge is incident to a subgraph K C G ii 
it connects a vertex in iG to a vertex in G\K. 

Definition 9. Let L be a string graph with boundary B C L. Then a matching 
of L onto a string graph H is an injective string graph homomorphism m : L ^ H 
where H ^ H and the only edges incident to the image m{L) C H are also 
incident to m{B). 

In other words, a matching satisfies the ‘no dangling wires’ condition with respect 
to the boundary of L. Note that matching is done modulo wire-homeomorphism. 
This allows wires in the target graph to grow if necessary to injectively embed 
the pattern. In the example below, the embedding on the left fails, but the 
embedding into a wire-homeomorphic graph (right) succeeds: 

As usual, string graph rewrite rules are encoded as spans L B ^ R, where B 
is the common boundary of L and R. 



Definition 10. A rewrite of a string graph G by a rule L ^ B —>■ R using a 
matching m : L ^ G (for G ^ G) consists of a pushout complement (1) followed 
by a pushout (2) in SGraph: 

L - B - ^ R 

( 1 ) ( 2 ) 

r i 

G-- I - -H 

It was shown in m that for any matching of the LHS of a string graph rewrite 
rule, the pushout complement (1) and the pushout (2) exist and are unique, so 
DPO rewriting for string graph rewrite rules is well-defined. 











3 Encoded string graphs and B-ESG grammars 


We will introduce a type of context-free graph grammar suitable for defining 
families of string graphs, which is essentially a restriction of the class of B- 
edNCE grammars. However, to squeeze out a bit more expressive power, rather 
than using such grammars to generate string graphs themselves, we use them 
to generate encoded string graphs. An encoded string graph allows us to ‘fold’ 
some collection of fixed subgraphs into single edges, which will allow us more 
flexibility when it comes to the types of languages we can produce. 

Definition 11 (Encoded string graph). Let £ = {o, /3,...} be a finite set of 
encoding symbols. An encoded string graph is a string graph where we additionally 
allow edges labelled by encoding symbols a £ £ to connect pairs of node-vertices. 


Definition 12 (Decoding system). A decoding system T is a set of DPO 
rewrite rules of the form: 

^ C>A - (3.1) 

one for every triple {a, Ni, N 2 ) G £ x JV x Af, where the LHS consists of a 
single edge labeled a connecting an TVi-labelled node-vertex to an A^ 2 -labelled 
node-vertex, and the RHS is a connected string graph that contains no inputs, 
outputs, or encoding labels. 


Note, the invariant part of (3.11 consists of the two shared node-vertices. Thus, 
by construction T is confluent (since no two rules apply in the same location) 
and terminating (since no encoding labels occur in the RHS of a rule). Thus, 
decoding an encoded string graph consists of normalising with respect to T. 


Throughout the rest of the paper, we assume that all of our grammars use the 
same vertex and edge label alphabets, S and F respectively, and also the same 
initial non-terminal S. The alphabet for terminal vertex labels is Z\ := AfU W. 
We do not allow any non-final edge labels. We assume £ C F and any edge with 
label in £ will be called an encoding edge. 


Definition 13 (B-ESG grammar). A B-ESG grammar is a pair B = {G,T), 
where T is a decoding system and G = (A, A, F, F, P, S) is a B-edNCE grammar, 
such that for every production X -G {D, C) G P, the following conditions are 
satisfied: 


Nl: An edge carries an encoding label if and only if it connects a pair of node¬ 
vertices. 

N2: Any connection instruction of the form {N,a/P,x,d) where N is a node¬ 
vertex label and cc is a node-vertex, must have fi G £. 

Wl: Every wire-vertex in D has in-degree at most one and out-degree at most 


one. 




W2: There are no connection instructions of the form {a,a//3,x,d) where a is 
any vertex label and a; is a wire-vertex. 

W3: For W a wire-vertex label and each 7 and d, there is at most one connection 
instruction of the form (W,j/6,x,d) and we must have 5 ^ £. 

W4: Let y be a non-terminal vertex with label Y in D. If y is adjacent to a 
wire-vertex labelled W via an edge with direction d and label /? or there’s 
a connection instruction of the form (W, a/f3,y, d), then all productions Y 
must contain a connection instruction of the form (VF, j3/j, z, d). 

The conditions N1 and N2 guarantee that node-vertices never become directly 
connected by an edge, unless that edge has an encoding label. Wl-3 ensure that 
wires never ‘split’, i.e. wire-vertices always have at most one input or output. 
The final condition, which won’t be necessary until the next section, ensures that 
inputs stay inputs and outputs stay outputs in a sentential form throughout the 
course of the derivation. 

Definition 14 (B-ESG concrete derivation). A concrete derivation for a 
B-ESG grammar B = (G, T) with S the initial non-terminal for G, consists of 
a derivation S Hi in G, where Hi contains no non-terminals, followed by 

a decoding Hi H 2 . We will denote such a concrete derivation as S 

Hi H 2 or simply with S =>^ H 2 if the graph Hi is not relevant for the 

context. 

Note that in the above definition S refers to both the initial non-terminal label, 
but also to a graph with a single vertex with label S which is the starting graph 
for a derivation. As usual, the language of a B-ESG grammar B is given by 
L{B) ■.= {H\S H}. 

Theorem 1. Every graph in the language of a B-ESG grammar is a string 
graph. 

Proof. Let B = (G, T) be a B-ESG grammar. Decoding an encoded string graph 
will always produce a string graph, so it suffices to show that any derivation from 
G produces an encoded string graph. We call a sentential form an ESG-form if 
it is an encoded string graph, which possibly has some additional non-terminals 
that are either connected to node-vertices or are connected to wire-vertices in 
such a way that all wire-vertices have at most 1 in-edge and 1 out-edge. We 
show that any derivation starting from an ESG-form is an encoded string graph. 
This can be done by induction on the length of derivations. If the derivation is 
length 0 , the sentential form has no non-terminals, so it is an encoded string 
graph. Otherwise, consider a derivation of length n. After the first step, any 
newly-introduced wire-vertices will have in-degree and out-degree at most 1 by 
W1 and W2, whereas the degrees of any already existing wire-vertices will not 
increase by W3. N1 and N2 will ensure that any resulting node-vertices will 
only be connected by edges with encoding labels, so the result is an ESG-form. 
Thus we can apply the induction hythothesis. Noting that S is, in particular, an 
ESG-form completes the proof. □ 



Lemma 1. For every B-ESG grammar B = (G,T), there exists n € N, such 
that if H G L{B) then H does not contain a wire with size bigger than n. 

Proof. Let n be the length of the longest wire in the bodies of the productions 
in G and T, and consider an arbitrary sentential form obtained from S H'. 
From condition W2, we see that expanding any of the non-terminals cannot 
create a new edge between an already established wire-vertex and a newly created 
wire-vertex. Therefore, a concrete derivation in G will produce an encoded string 
graph with maximum length of any wire n. Then, while doing the decoding, T 
will only replace edges between node-vertices and therefore a wire longer than 
n cannot be established. □ 

Naturally, we want to be able to decide if a given string graph is in a B-ESG gram¬ 
mar. Since there should be no distinction between wire-homeomorphic string 
graphs, we state the membership problem as follows: 

Problem 1 (Membership). Given a string graph PI and a B-ESG grammar B, 
does there exist a string graph H H, such that H G L{B)1 In such a case, 
construct a derivation sequence S H. 

Theorem 2. The membership problem for B-ESG grammars is decidable. 

Proof. First, we show that exact membership (i.e. not up to wire-homeomorphism) 
is decidable. From Theorem[2 we know that any concrete B-ESG derivation pro¬ 
duces an encoded string graph which is then decoded to a string graph. Since 
the decoding sequence can only increase the size of a graph, we can limit the 
problem to considering all graphs of size smaller than H. However, there are 
finitely many graphs whose size is smaller than H. For each such graph H', we 
can then decide if H' G L{G) (this is the membership problem for B-edNGE 
grammars). Finally, we check if H' H which is also clearly decidable. If no 
such graph H' exists, then the answer is no and otherwise the answer is yes. 

We now generalise to the wire-homeomorphic case. There may be infinitely many 
string grapl^ H such that H ^ H, but by using Lemma it suffices to consider 
only those H which do not have wires longer than some fixed n S N. There are 
finitely many of these, so we can check if at least one of them is in L{B) as before. 
Finally, since derivation sequences are recursively enumerable, if H G L{B), we 
can also construct a concrete derivation sequence S H. □ 

In section]^ we will show how to use B-ESG grammars for rewriting. To do this, 
we must show that the grammar produces some graphs which can be matched 
onto a given string graph, and ideally that the set of all matchings is finite, so 
that we can enumerate all of the relevant equalities. This is not true in general, 
but it is true whenever the B-ESG grammar satisfies some simple conditions: 


Definition 15 (Match-exhaustive B-ESG grammar). We say that a B- 
ESG grammar B — {G,T) is match-exhaustive, if (1) any production X —>■ 
{D, C) which contains a bare wire in D has a finite bound on the number of 
times it can be expanded in any sentential form (2) no production contains an 
isolated wire-vertex in its body (3) there are no empty productions and (4) there 
are no productions consisting of a single node which is non-terminal. 

Note that condition (1) can be easily decided by examining all productions which 
could possibly lead back to themselves via non-terminals, and seeing if they 
contain any bare wires. In m it was furthermore shown that any grammar can 
be transformed into an equivalent grammar satisfying conditions (3) and (4). 

Problem 2 (Match-enumeration). Given a string graph H and a B-ESG grammar 
B, enumerate all of the B-ESG concrete derivations S K, such that there 
exists a matching m ■. K ^ H iox some H ^ H. 


Theorem 3. The match-enumeration problem for a B-ESG grammar B is de¬ 
cidable if S is a match-exhaustive grammar. 


Proof. Let W be the number of wires in a string graph P[. Then, for any P[ H, 
we know that H must have the same number of wires and node-vertices as H . 
However, the number of wire-vertices in P[ may be arbitrarily large. 


Gondition (1) of Definition 15 implies that there exists n S N, such that, for 
any K S L[B), K has at most n bare wires. Any matching of string graphs will 
map at most two non-bare wires onto a single wire. So, if K has a matching 
on H, then it can have at most 2W non-bare wires. Therefore, K can have at 
most 2W -+- n wires. From Lemma we know that the length of any such wire 
is bounded. Gondition (2) of Definition 15 implies that any wire-vertex in K is 
part of some wire, thus there is a bound on the number of wire-vertices in K 
and we already know the number of node-vertices is also bounded. Thus, there 
are finitely many K S L[B) which could possibly have a matching onto some 
iL ~ iL, so we can enumerate them. 


Finally, conditions (3) and (4) from Definition 15 imply that the sentential forms 
of B can only increase in size and therefore for any K satisfying the above 
conditions there are finitely many concrete derivations S K which we can 
enumerate. □ 


4 B-ESG rewrite patterns 

In the previous section we introduced a new type of grammar which generates 
string graphs and which has some important decidability properties. Thus, we 
can use a single B-ESG grammar to represent a single family of string graphs. 
However, we still have not described how to encode equalities between families 
of string graphs. This is the primary contribution of this section. We introduce 





the notions of B-ESG rewrite pattern and B-ESG pattern instantiation which 
show how this can be achieved in a formal way and then we give examples of 
important equalities which cannot be encoded using the !-graph formalism, but 
are expressible using B-ESG rewrite patterns. 

Definition 16 (B-ESG rewrite pattern). A B-ESG rewrite pattern is a pair 
of B-ESG grammars Bi = (Gi, T) and B 2 = (G 2 , T), where Gi = (A, A, E, E, Pi,S) 
and G 2 = {E, A, E, E, P 2 , S), such that there is a bijective correspondence be¬ 
tween the productions in Pi and P 2 given hy X ^ (Di,Gi) € Pi iff A —>■ 
{ 02 , 02 ) & P 2 and the corresponding pairs of productions satisfy the following 
conditions: 


NT: There is a bijection between the non-terminal nodes in Di and the non¬ 
terminal nodes in £>2 which also preserves their labels. 

10: There is a bijection between the inputs (resp. outputs) in Di and the inputs 
(resp. outputs) in 02 - 


Gondition NT ensures that we can perform identical derivation sequences on 
both grammars Gi and G 2 in the sense that we can apply the same order of 
productions to corresponding non-terminal vertices. This should become more 
clear from definition |17| When working with B-ESG rewrite patterns, without 
loss of generality, we will assume that the corresponding inputs/outputs/non¬ 
terminal nodes are identified between the two grammars by sharing the same 
name, instead of using a bijective function explicitly. 


Definition 17 (B-ESG pattern instantiation). Given a B-ESG rewrite pat¬ 
tern (Pi, P 2 ), a B-ESG pattern instantiation is given by a pair of concrete deriva- 


tions: 

S = 

~^vi,pi 

Hi = 

’^V2,P2 

H 2 

and 

S = 

^B2 

^Vi,pi 

H[ = 

^B2 

^V2,P2 

H '2 


v3ip3 


^B2 

v3,p3 


H„ 


>B2 

Vn,Pn. n 


F 


F' 


In other words, we use an identical derivation sequence in the two B-edNGE 
grammars to get two encoded string graphs, which are then uniquely decoded 
using the productions of T. These ideas are similar to the pair grammars ap¬ 
proach presented in m, but different in that we are using a more general notion 
of grammar, our extension to the context-free grammars is more limited and our 
focus is on string diagram reasoning rather than computer program representa¬ 
tion. These ideas have been further generalised in m- 


Theorem 4. Every B-ESG pattern instantiation is a string graph rewrite rule. 


Proof. Gonsider a concrete derivation as in Definition |17| From Theorem we 
know that F and F' are string graphs. We have to show that they have the same 
set of inputs/outputs. Note, that the decoding process using the productions 


of T cannot establish any new inputs or outputs and thus we can reduce the 
problem to showing that i?„ and H'^ have the same sets of inputs/outputs. 

Setting Hq := S := Hq, we can prove by induction that any pair of sentential 
forms has the same set of inputs. This is trivially true for {Hq,Hq). 

Assuming that {Hk, H'f.) have the same set of inputs, observe that {Hk+i, 
is obtained by applying a corresponding pair of productions to These 

productions satisfy all of the listed conditions in Definition [T^ and Definition [T6| 
In particular, conditions W3 and W4 guarantee that the in-degree of all wire- 
vertices in both Hk and H^. won’t be affected. Condition W2 implies that the 
newly created wire-vertices in Hk+i and are not connected to any of the 

previously established vertices in Hk and respectively. Combining this with 
condition 10 ensures that any newly created wire-vertices are inputs in Hk+i iff 
they are inputs in Thus, Hk+i and have the same set of inputs. In 

particular, and H'^ have the same inputs. By symmetry, H^ and H'^ have 
the same output vertices. □ 


As mentioned in the introduction, we now gain previously non-existent expressive 
power for families of string graph rewrite rules. For instance, we can write a rule 
that merges a chain of white node-vertices, each with 1 input, into a single white 
node-vertex with n inputs as follows: 



In the introduction, we also gave an example of a rule involving clique-like graphs. 
In fact, a minor modification to (1.4) yields a rule that is directly relevant for 
quantum computation. The following rule, called local complementation: 



is crucial to establishing the completeness of a rewrite system called the ZX- 
calculus, which has been used extensively for reasoning about quantum circuits 
and measurement-based quantum computation. This rule uses a non-trivial de¬ 
coding system, where the /i-labelled edges are interpreted as wires containing a 
single node-vertex. Here we are using to mean ‘any node-vertex label’. The 
node-vertices labelled represent quantum phase gates, whereas the H is & 

Hadamard gate. The interested reader can find a detailed description of the ZX- 
calculus in e.g. [7] the completeness theorem based on local complementation 



































































5 Transforming B-ESG grammars 


In the previous section, we showed how a B-ESG rewrite pattern can encode 
(infinitely) many equalities between string diagrams, where we represent the 
equalities as string graph rewrite rules. Thus, we can use B-ESG rewrite patterns 
to encode axioms and axiom schemas of a string diagram theory. However, in 
order to perform equational reasoning between families of string diagrams, we 
need to show how we can use such axioms in our formalism in order to obtain 
new equalities. This is the primary contribution of this section. In particular, we 
show how we can transform a B-ESG rewrite pattern into another one using a 
string graph rewrite rule in an admissible way. Finally, we generalise this result 
by showing how we can transform B-ESG rewrite patterns using another B-ESG 
rewrite pattern in an admissible way. 

Definition 18 (Final subgraph). Given a production X —> {D,C) in a B- 
edNGE grammar, we say that a subgraph S' of D is final if S is a string graph 
which is not adjacent to any non-terminals in D and no vertex of S has associated 
connection instructions in C. 

Definition 19 (B-ESG transformation step). We say that a given grammar 
B = {G, T) can be transformed into another grammar B' = (G', T) using a given 
string graph rewrite rule SR = L ^ i?, if (1) there exists a production 

p = X —>■ (D, G) in G, such that L can be matched into a final subgraph of D 
(2) G' has the same productions as G, except for p which has been modified to 
p' •= {D\ C) where D' is the result of applying the rewriting rule SR to the 
matching of (1). 

Theorem 5. Given a B-ESG grammar B, the pair {B,B') is a B-ESG rewrite 
pattern, where B' is the result of a B-ESG transformation step applied to B. 

Proof. Let B' = {G',T). Assume that the modified production is p = W —>■ 
{D, G) of B = (G, T) and the corresponding production in G' is p' = A —>■ 
{D\G). Rewriting D D' only affects a final subgraph S of D and thus it 
cannot establish any edges between non-terminals or modify the connection in¬ 
structions. Therefore, G' is a B-edNGE grammar and conditions N2, W2, W3, W4 
and NT are satisfied. Gonditions W1 and N1 are also preserved, because we 
are replacing a string graph S with another string graph and the edges between 
vertices in D\S and S are preserved. Thus, B' is a B-ESG grammar. Finally, 
condition 10 is satisfied, because string graph rewriting does not create or re¬ 
move inputs/outputs. □ 

Theorem 6. Let R be a B-ESG grammar, let SR be a string graph rewrite 
rule and let {B,B') be the B-ESG rewrite pattern induced by B and SR as per 
Theorem]^ Then, {B,B') is admisible in the sense that any pattern instantiation 
S F together with S F' are such that the string graph F' can be 

obtained from F via repeated applications of SR. 


Proof. Let p be the production of B which is modified by SR and let p' be the 
result of the modification. Because the modification is done on a final subgraph 
S of p, as p is expanded, a copy of S is created which is not adjacent to any 
previously established vertices. Moreover, the copy S is not adjacent to any non¬ 
terminals and thus its neighbourhood won’t change in the following sentential 
forms. Therefore, to obtain F' from F, we have to apply the rule SR exactly n 
times, where n is the number of times the production p appears in the pattern 
instantiation. □ 

Corollary 1. If B is a B-ESG grammar and {Bi , i32) is a B-ESG rewrite pattern 
with Bi match-exhaustive, then we can enumerate the pattern instantiations of 
{Bi,B 2 ) which induce an admisible rewrite pattern {B,B'). 

Proof. From Theorem]^ any pattern instantiation of (i?i, B^) is a string graph 
rewrite rule. If such a string graph rewrite rule matches a final subgraph in some 
production of B, then it induces an admisible rewrite pattern (B, B') as shown in 
Theorem Finally, by Theorem we can enumerate all of those instantiations 
and thus all such B'. □ 


6 Conclusion and future work 


In this paper, we have defined a family of grammars that is suitable for describing 
languages of string graphs and string graph rewrite rules. We have also showed 
that these languages admit nice decidability properties. There are two natural 
directions in which to extend this work. Firstly, the conditions of a B-ESG 
grammar are sufficient, but not necessary to obtain a language consisting of only 
string graphs. One might ask if there exist other natural and easily-decidable 
conditions yielding such languages, and how those conditions relate to the B- 
ESG conditions. 

The second, and more compelling direction for future work is the development of 
tools for reasoning with B-ESG rewrite patterns and, most importantly, deriving 
new patterns. We took the first step in this direction in Section where we 
described a fairly limited technique whereby grammars can be transformed by 
using string graph rewrite rules to rewrite the final parts of some productions. 
However, in the case string graph patterns based on !-boxes, which we briefly 
discussed in the introduction, we additionally have the ability to do geniune 
pattern-on-pattern rewriting, and derive new !-box rules using !-box induction. 
We expect both of these techniques to extend to the context-free case. Thus, 
we would ultimately like a much richer notion of rewriting, whereby B-ESG 
patterns can be used to rewrite non-final parts of grammars, where e.g. non¬ 
terminals are allowed to match on non-terminals, subject to suitable consistency 
conditions. Similarly, we intend to extend !-box induction to a more general 
structural induction principle that can be used to derive B-ESG rewrite patterns 
from basic string graph rules. This will provide a method for deriving infinite 


families of rules which is both extremely powerful and capable of producing 
machine-checkable diagrammatic proofs. 
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